Sequence and expression analysis of gaps in human chromosome 20

نویسندگان

  • Sheroy Minocherhomji
  • Stefan Seemann
  • Yuan Mang
  • Zahra El-schich
  • Mads Bak
  • Claus Hansen
  • Nickolas Papadopoulos
  • Knud Josefsen
  • Henrik Nielsen
  • Jan Gorodkin
  • Niels Tommerup
  • Asli Silahtaroglu
چکیده

The finished human genome-assemblies comprise several hundred un-sequenced euchromatic gaps, which may be rich in long polypurine/polypyrimidine stretches. Human chromosome 20 (chr 20) currently has three unfinished gaps remaining on its q-arm. All three gaps are within gene-dense regions and/or overlap disease-associated loci, including the DLGAP4 locus. In this study, we sequenced ∼ 99% of all three unfinished gaps on human chr 20, determined their complete genomic sizes and assessed epigenetic profiles using a combination of Sanger sequencing, mate pair paired-end high-throughput sequencing and chromatin, methylation and expression analyses. We found histone 3 trimethylated at Lysine 27 to be distributed across all three gaps in immortalized B-lymphocytes. In one gap, five novel CpG islands were predominantly hypermethylated in genomic DNA from peripheral blood lymphocytes and human cerebellum. One of these CpG islands was differentially methylated and paternally hypermethylated. We found all chr 20 gaps to comprise structured non-coding RNAs (ncRNAs) and to be conserved in primates. We verified expression for 13 candidate ncRNAs, some of which showed tissue specificity. Four ncRNAs expressed within the gap at DLGAP4 show elevated expression in the human brain. Our data suggest that unfinished human genome gaps are likely to comprise numerous functional elements.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

I-3: Human Y Chromosome Proteome Project 2012 Update

The Human Genome Project has generated a blueprint for the approximately 20,300 gene-encoded proteins potentially active in any of 230 cell types that make up the human body (human proteome). However, based on the UniProtKB/Swiss-Prot database content, about 6000 of at the protein level; for many others, there is very little information related to protein function, abundance, subcellular locali...

متن کامل

I-49: Human Y Chromosome ProteomeProject

The success of the Human Genome Project (HGP) has provided a blueprint for the approximately 20,000 gene-encoded proteins potentially active in all of the hundreds of cell types that make up the human body. Yet we still have limited knowledge about a majority of the gene-encoded proteins which are the “building blocks of life” and “cellular machinery”. It is estimated that for nearly half of th...

متن کامل

P-121: Cloning and Expression of The Inosine Triphosphate Pyrophosphatase Gene Variant II in E.coli

Background Environmental and cellular inappropriate conditions can cause damages to cells nucleotide poll. Deamination and oxidation damages interfere with cell�s vital reactions. Inosine triphosphate pyrophosphatase (ITPA), an evolutionary conserved enzyme, plays a critical role in elimination of non-canonical bases. In human genome, the ITPA gene is located on chromosome 20 short arm and tran...

متن کامل

Expression and Secretion of Human Granulocyte Macrophage-Colony Stimulating Factor Using Escherichia coli Enterotoxin I Signal Sequence

With the aim of the secretion of human granulocyte macrophage-colony stimulating factor (hGM-CSF) in Escherichia coli, hGM-CSF cDNA was fused in-frame next to the signal sequence of ST toxin (ST-I) of exteroxigenic E. coli, containing 53 or 19 amino acids of signal peptide. The fused STsig::hGM-CSF coding fragments were inserted into a T7-based expression plasmid. The recombinant plasmids were ...

متن کامل

I-39: Exploring New Frontiers in Human Y Chromosome Proteome Project

The major goal of the Chromosome-Centric Human Proteome Project (C-HPP) is to systematically map the entire human proteome with the intent to enhance our understanding of human biology at the cellular level. However, this goal may be hindered by the lack of quality observations of given proteins due to absence of expression in a given tissue, very low abundance, and expression only in rare samp...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 40  شماره 

صفحات  -

تاریخ انتشار 2012